Offline Quantum Reinforcement Learning in a Conservative Manner

نویسندگان

چکیده

Recently, to reap the quantum advantage, empowering reinforcement learning (RL) with computing has attracted much attention, which is dubbed as RL (QRL). However, current QRL algorithms employ an online scheme, i.e., policy that run on a computer needs interact environment collect experiences, could be expensive and dangerous for practical applications. In this paper, we aim solve problem in offline manner. To more specific, develop first (offline QRL) algorithm named CQ2L (Conservative Quantum Q-learning), learns from samples does not require any interaction environment. utilizes variational circuits (VQCs), are improved data re-uploading scaling parameters, represent Q-value functions of agents. suppress overestimation Q-values resulting data, double Q-learning framework reduce bias; then penalty term encourages generating conservative designed. We conduct abundant experiments demonstrate proposed method can successfully tasks counterpart not.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Offline Evaluation of Online Reinforcement Learning Algorithms

In many real-world reinforcement learning problems, we have access to an existing dataset and would like to use it to evaluate various learning approaches. Typically, one would prefer not to deploy a fixed policy, but rather an algorithm that learns to improve its behavior as it gains more experience. Therefore, we seek to evaluate how a proposed algorithm learns in our environment, meaning we ...

متن کامل

Reinforcement Learning in Neural Networks: A Survey

In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...

متن کامل

Reinforcement Learning in Neural Networks: A Survey

متن کامل

Generalized Quantum Reinforcement Learning with Quantum Technologies

We propose a protocol to perform generalized quantum reinforcement learning with quantum technologies. At variance with recent results on quantum reinforcement learning with superconducting circuits [L. Lamata, Sci. Rep. 7, 1609 (2017)], in our current protocol coherent feedback during the learning process is not required, enabling its implementation in a wide variety of quantum systems. We con...

متن کامل

reinforcement learning in neural networks: a survey

in recent years, researches on reinforcement learning (rl) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. neural network reinforcement learning (nnrl) is among the most popular algorithms in the rl framework. the advantage of using neural networks enables the rl to search for optimal policies more efficiently in several real-life applicat...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i6.25872